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Abstract 



A classical and widely used lemma of Erdos and Szekeres asserts that for every n there 
exists N such that every n-term sequence a of real numbers contains an n-term increasing 
i-^ subsequence or an n-term nondecreasing subsequence; quantitatively, the smallest N with 

this property equals (rt — 1)^ + 1. In the setting of the present paper, we express this 
^ lemma by saying that the set of predicates $ = {xi < X2,xi > X2} is Erdos-Szekeres 

i_i with Ramsey function ES#(n) = (n — 1)^ + 1. 

In general, we consider an arbitrary finite set $ = {$1, . . . , of semialgebraic 
predicates, meaning that each $j = ^j{xi, . . . , Xk) is a Boolean combination of polynomial 
equations and inequalities in some number k of real variables. We define $ to be Erdos- 
Szekeres if for every n there exists N such that each iV-term sequence a of real numbers 
t — ■ has an n-term subsequence b such that at least one of the holds everywhere on b, which 

means that , . . . ,bi^) holds for every choice of indices 11,12, ■■■ ,ik, 1 < h < < 

r — ■ ■ ■ < ik < n. We write ES*(n) for the smallest TV with the above property. 

We prove two main results. First, the Ramsey functions in this setting are at most 
doubly exponential (and sometimes they are indeed doubly exponential): for every # 
• • that is Erdos-Szekeres, there is a constant C such that ES#(n) < 2^ . Second, there is 

an algorithm that, given decides whether it is Erdos-Szekeres; thus, one-dimensional 
Erdos-Szekeres-style theorems can in principle be proved automatically. 

We regard these results as a starting point in investigating analogous questions for d- 
dimensional predicates, where instead of sequences of real numbers, we consider sequences 
of points in M'' (and semialgebraic predicates in their coordinates). This setting includes 
many results and problems in geometric Ramsey theory, and it appears considerably 
more involved. Here we prove a decidability result for algebraic predicates in K'' (i-e., 
conjunctions of polynomial equations), as well as for a multipartite version of the problem 
with arbitrary semialgebraic predicates in M''. 



1 Introduction 

Motivation and background. Ramsey-type theorems claim that, generally speaking, 
any sufficiently large structure of a given kind contains a "very regular" substructure of a 
prescribed size. The following two gems from a 1935 paper of Erdos and Szekeres [7] belong 
among the earliest, best known, and most useful instances. 

*Both authors were supported by the ERG Advanced Grant No. 267165. 
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Theorem 1.1 (On monotone subsequences). For every n >2, every sequence (oi, 02, • • • , aat) 
of real numbers, with N > (n — 1)^ + 1, contains a monotone subsequence of length n; more 
precisely, there are indices ii < 12 < • • • < in such that either Oi^ < ■ ■ ■ < ai^ or ai^> ■ ■ ■ > 

See, for example, Steele [17] for a collection of six nice proofs and many applications. 

Theorem 1.2 (On subsets in convex position). For every n > 3, among every N = i^^Z2) + 
1 PS 4"/-^/n points in the plane, no three collinear, one can always select n points in convex 
position (i.e., forming the vertex sets of a convex n-gon). 

See, e.g., [12, 10] for proofs and surveys of developments around this result. 

Many geometric Ramsey-type questions have been investigated in the literature; here we 
mention just three examples that directly motivated our research. 

The colored Tverberg theorem, conjectured by Barany, Fiiredi, and Lovasz [1] and proved 
by Vrecica and Zivaljevic [20] asserts that for every d and r there exists t such that if 
Ai, . . . , Ad+i are t-point sets in (we imagine that each Ai has its own color; e.g., the points 
of Ai are red, those of A2 blue, etc.), there are r-point subsets Bi C Ai, i = 1,2, . . . ,d + \ and 
a point X G M'^ that lies in the convex hull of {bi, . . . , bf^+i} for every choice 0/ bi € -Bi,. . . , 
b(i_|_i € Bci+i. For us, this result is remarkable because all known proofs use topological 
methods — there is no known proof by "ordinary" geometric and/or combinatorial arguments 
(although some special cases do have elementary proofs). 

The following result was needed as a lemma in the paper [3] by Loh, Nivasch, and the 
first author: every sequence (ai,a2, . . . ^sln) of points in with increasing x-coordinates has 
a subsequence (bi,...,b„) in which every 7-tuple bj^,...,bj7, ii < ■■■ < ij, is as in the 
next picture, i.e., the triangle bjjbj^bjg contains the intersection of the segments bj^bj^ and 
bjgbj^ — provided that N is sufficiently large in terms of n. 




While the proof in [3] is simple, the situation with an appropriate d-dimensional generalization 
is discouraging: there are increasingly complicated proofs up to dimension 4, while the 5-di- 
mensional case already seems out of reach with the present methods. 

Yet another example we want to mention is a Ramsey-type result of Elias and the second 
author [6]: every N -point sequence (ai,...,ajv) of points in the plane with increasing x- 
coordinates has an n-term subsequence that is kth order monotone, meaning that either every 
(k + l)-tuple of points lies on the graph of a smooth function with nonnegative kth derivative, 
or every {k + l)-tuple lies on the graph of a smooth function with nonpositive kth derivative. 
(This is a common generalization of Theorems 1.1 and 1.2.) Here N depends on n and k; 
the existence of some N follows immediately from Ramsey's theorem for (fc + l)-tuples, but 
an interesting question here is the behavior of the Ramsey function — how big is the smallest 
N = Nk{n) that works. We have A^i(n) ~ and N2{n) ~ 4" according to Erdos and Szekeres, 
and in [6] it was proved that N'i{n) is doubly exponential, i.e., 2^"^" < N^{n) < 2'^"^" . The 
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order of magnitude of A^4(n) is unknown, and so are the Ramsey functions for numerous other 
geometric Ramsey-type results. 

Erdos— Szekeres predicates. The above examples and some others motivate (at least) 
three general questions formulated below. Before stating them, we introduce some notions 
and notation. 

Let k be an integer, which we think of as small and fixed, and let $ = <I'(xi, . . . , x^) be a 
d-dimensional fc-ary predicate, by which we mean a mapping (M'^)'^ — ?• {False, True}. We say 
that <I> holds everywhere on a sequence a = (ai, . . . ,an) of points in M'^ if <I>(aj^,aj2, • . . 
holds for every increasing A;-tuple ii, . . . ,ik of indices, 1 < ii < ^2 • • • < ifc < n. 

Definition 1.3. Let ^ be a set of d- dimensional predicates. We say that $ is Erdos-Szekeres 
if for every n there exists N such that every sequence a of length N has subsequence b of 
length n such that there is a predicate <I> G $ that holds everywhere on b. Let ES^(n) denote 
corresponding Ramsey function, i.e. the smallest N with the property above. 

For example, Theorem 1.1 on monotone subsequences can be re-stated as follows in this 
language: the set $ = {xi < X2,xi > X2} of 1-dimensional predicates is Erdos-Szekeres, 
with ES$(n) = (n — 1)^ — 1 (note that we do not write points in M^, i.e., real numbers, in 
boldface — unlike points in M'^). 

For Theorem 1.2 on subsets in convex position, we can let $ consist of a single 4-ary 
predicate <I>conv(xi, . . . ,X4) expressing that the 4-tuple xi, . . . , X4 is in convex position (with 
appropriate handling of collinearities). Indeed, a set is in convex position if each of its 
4-tuples is. Alternatively, we can set := {<&pos, "^'neg}; where $pos(xi, X2, X3) expresses 
that the ordered triple (xi,X2,X3) has a positive orientation, while $neg expresses negative 
orientation. 

Let us remark that the notion of Erdos-Szekeres set of predicates is not general enough to 
express the colored Tverberg theorem mentioned above, for example; however, it could easily 
be extended to a colored setting, if that proved useful.^ 

Another thing worth noting here is that given a finite set $ of predicates, it is possible to 
produce a single predicate $ such that $ is Erdos-Szekeres iff is. Moreover, the functions 
ES^ and ES{$} can be related; see Section 8 for a discussion. However, the passage from $ 
to $ increases the number of variables and produces a rather cumbersome predicate while 
treating a set of predicates in our development is not much more complicated than treating 
a single one, so we prefer dealing with sets of predicates. 

The predicates in Theorems 1.1 and 1.2, as well as in the other examples above and 
elsewhere in geometric Ramsey theory, can be represented as semialgebraic predicates, where 
a d-dimensional k-ary semialgebraic predicate $ = $(xi, . . . ,Xfc) is a Boolean combination 
of polynomial equations and inequalities in the coordinates of the k points xi , . . . , x/; E M*^ 
(the polynomials are assumed to have rational coefficients). In this paper we consider only 
semialgebraic predicates. 

We consider the following general questions quite natural and very interesting. 

^Perhaps more seriously, there are results and problems in geometric Ramsey theory that do not fit this 
kind of framework at all. For example, the famous recent 6-hole theorem of Gerken [8] and Nicolas [13] asserts 
that every sufficiently large point set P C in general position contains an empty hexagon, i.e., six points in 
convex position whose convex hull contains no other point of P. Here the problem is that this is not a "pure" 
Ramsey-type result, since, although we pass to a subset, the original points still play a role as "obstacles" . We 
suspect general problems of this kind to be substantially harder than those considered in this paper. 
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1. (Ramsey function) What is the largest possible order of magnitude o/ES^, where $ is 
a finite set of d- dimensional k-ary semialgebraic predicates? 

A simple upper bound follows from Ramsey's theorem. For $ = {$i, . . . , ^m} and a 
sequence a of length N, we color a A:-tuple / C {1, 2, . . . , A^} with color i £ {1, 2, . . . , m} 
if $j holds for the fc-tuple in a indexed by / (if there are several possibilities, we pick 
one arbitrarily), The color m + 1 is used for the fc-tuples on which no holds. By 
Ramsey's theorem, if > Rk{n; m + 1) (the Ramsey number for fc-tuples with m + 1 
colors), a contains a subsequence of length n in which all /c-tuples have the same color. 
If $ is Erdos-Szekeres, then for n sufficiently large, color m + 1 is impossible. Thus, 
we get ES$(n) < Rk{n; m + 1) for all sufficiently large n, and Rk{n; m + 1) is bounded 
above by a tower function of height k — see, e.g., [9]. 

2. (Decidability) Is there an algorithm that, given a finite set $ of d- dimensional semial- 
gebraic predicates, decides whether $ is Erdds-Szekeres? 

By a celebrated result of Tarski [18], the first-order theory of the reals is decidable. 
That is, there is an algorithm deciding the validity of formulas of the form 

{Qixi) . . . {QkXk)^{xi, . . . , Xfc), 

where Qi, ■ ■ ■ ,Qk G {V, 3} are quantifiers for the real variables xi, . . . ,Xk and $ is a 
semialgebraic predicate in our sense. This is one of the most useful decidability results, 
and there is an extensive area studying efficient algorithms for this problem and various 
special cases (see, e.g., [2]). This may give some hope for decidability of the Erdos- 
Szekeres property, and quantifier-elimination methods for the first-order theory of the 
reals play a significant role in the present paper. 

3. (Homogeneous, or indiscernible, sequences) What can be said about very long, or in- 
finite, point sequences that are homogeneous w.r.t. interesting classes of semialgebraic 
predicates? 

To explain this question, let us first consider all 2-dimensional semialgebraic predicates 
$ that depend only on the orientation of triples of points; for example, "xi lies in 
the convex hull of X2,X3,X3". Then every point sequence C in convex position (i.e., 
the points are in convex position and numbered along the circumference of the convex 
hull either clockwise or counterclockwise) is homogeneous for this class of predicates, 
meaning that each A:-ary such predicate <I> either holds on all fc-tuples of C or holds 
on none of them. Moreover, as far as these predicates are concerned, all sequences in 
convex position look the same (except for two possible orientations, that is). 

However, some natural Ramsey-type questions, such as the colored Tverberg problem or 
the question with triangles containing segment intersections illustrated by the picture 
above, need "higher-order" geometric predicates, such as whether the intersection of 
two lines, each determined a pair of the considered points, lies above or below the line 
spanned by another pair of points. (Semialgebraic predicates of course include this kind 
of predicates and much more.) Here it is far from obvious what homogeneous sequences 
for such predicates might look like. Let us remark that one kind of a homogeneous 
set for semialgebraic predicates of bounded complexity, called the stretched diagonal, 
was used as an interesting example in the recent papers [4, 5], together with a related 
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construction of a stretched grid — this is yet another motivation of our investigations 
here. 

Answers for dimension 1. Luckily, after investigating this kind of problems for some 
time, we found the (unpublished) thesis of Rosenthal [16] which, in the language of logic 
and model theory, provided an answer to the third question (homogeneous sets) for d = 1. 
With the help of Rosenthal's results and methods, together with other techniques, we then 
succeeded in answering the first two questions for d = 1. 

We were surprised by the answer to the first question: it turned out that in dimension 1, 
the Ramsey functions can be at most doubly exponential, independent of the arity of the 
predicates. 

Theorem 1.4. For every finite Erdos-Szekeres set $ of 1 -dimensional predicates there exists 
a number C = C($) such that 

ES*(n) < 2^^". 
The proof is presented in Sections 2-4. 

In Section 7 we will show that some predicates indeed require a doubly exponential bound. 

Proposition 1.5. There is a finite Erdos-Szekeres set $ of predicates with ES^(n) > 2'^"", 
with a suitable constant c > 0. 

The proof of Theorem 1.4 also immediately provides a quantitative result for homogeneous 
subsequences for every semialgebraic predicate, Erdos-Szekeres or not. 

Proposition 1.6. For every finite set $ of 1-dimensional semialgebraic predicates there is 
a constant C = C($) such that for every integer n, every sequence a of length 2^ " contains 
an n-term subsequence b such that for each predicate G $ holds either everywhere on b or 
nowhere on b. 

By adding some more ingredients to the proof of Theorem 1.4 in Sections 5 and 6, we will 
also prove that 1-dimensional Erdos-Szekeres predicates can be recognized algorithmically. 

Theorem 1.7. There is an algorithm that, given a finite set $ of 1-dimensional semialgebraic 
predicates, decides whether it is Erdos-Szekeres. 

Higher dimensions: algebraic predicates and the multipartite case. For d > 2, all 

of the three questions above appear much harder than for d = 1, and at the time of writing 
this paper we have just some preliminary results for d = 2. 

Here we provide decidability results with d arbitrary for a restricted class of predicates, 
as well as for general semialgebraic predicates but with a different Ramsey-type question. 

Let us define an algebraic predicate, a special kind of a semialgebraic predicate, as a d- 
dimensional predicate ^*(xi, . . . ,Xfc) expressible as a conjunction of polynomial equations in 
the coordinates of xi, . . . ,Xfc (with rational coefficients). We note that while a semialgebraic 
predicate defines a semialgebraic set (in (M'^)^), an algebraic predicate defines an algebraic 
variety. 

For algebraic predicates the question of being Erdos-Szekeres does not make much sense, 
since every nontrivial algebraic predicate fails on a generic /c-tuple of points. However, the 
question of whether there exist arbitrarily long point sequences, or even infinite ones, on 
which a given algebraic predicate holds everywhere, is meaningful. The following theorem 
shows that it is decidable. 
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Theorem 1.8 ("Effective compactness" for algebraic predicates). For every d, D, and k 
there exists N, for which an explicit bound can be given, such that for every d- dimensional 
k-ary algebraic predicate in which all the polynomials have degree at most D, the following 
two conditions are equivalent: 

(i) There exists a sequence a. of N points in on which $ holds everywhere. 

(ii) There exists an infinite sequence a of points in on which ^ holds everywhere. 

Since condition (i) can be tested using a decision algorithm for the first-order theory 
of the reals, we obtain a decision algorithm for testing the existence of infinite sequences, 
or equivalently, of arbitrarily long sequences, on which a given algebraic predicate holds 
everywhere. 

Let us remark that the statement of the theorem also holds with the real numbers replaced 
by the complex numbers. 

We note that a statement analogous to Theorem 1.8 fails badly for semialgebraic predi- 
cates. Indeed, the 1-dimensional binary predicate X2) := (xi > 0) A(x2 > xi + 1) A(x2 < 
2xi) holds everywhere on arbitrarily long finite sequences, such as (n, n + 1, . . . , 2n), but on 
no infinite sequence. Similarly, (xi > 0) A (x2 > xi + 1) A (x2 < A), where ^4 is a constant, 
admits ^-term sequences but not longer. 

Next, we let $(xi, . . . ,x;c) be a d-dimensional semialgebraic predicate, but we ask a dif- 
ferent question. Let Ai,...,Ak be point sets in W^. We say that $ holds everywhere on 
AiX ■ ■ ■ X Ak'ii <I?(ai, . . . , a^) holds for every choice of points ai G ^1,. . . , a^ S A^. We have 
the following analog of Theorem 1.8. 

Theorem 1.9 ("Effective compactness" for the multipartite setting). For every d, D, and 
k there exists N , for which an explicit bound can be given, such that for every d-dimension- 
al k-ary semialgebraic predicate ^, in which all the polynomials have degree at most D, the 
following two conditions are equivalent: 

(i) There exist N -point sets Ai, . . . , A^ in such that $ holds everywhere on Aix - ■ - x A^. 

(ii) There exist infinite sets Xi, . . . , Xf^ in such that <I> holds everywhere on Xi x • • • x X^. 

Theorems 1.8 and 1.9 are proved in Section 9; the proofs are similar and more or less 
independent of the rest of the paper. 

2 Ramseying for fast-growing sequences 

Here we work in the 1-dimensional setting, i.e., with sequences of real numbers, and we begin 
with preparations for the proofs of Theorems 1.4 and 1.7. 

i?- growing sequences. The first idea in our approach is that if <^ is a 1-dimensional 
semialgebraic predicate and a is a sequence that grows sufficiently fast (where the speed of 
growth is quantified with respect to $), then either $ holds everywhere on a, or $ holds 
nowhere on a (by which we mean that the negation holds everywhere on o). Moreover, 
one can decide between these two possibilities "syntactically", just from the structure of 
without any information about a (except for a guarantee of the fast growth) . 

This will be expressed more precisely below, but first we formalize "growing sufficiently 
fast". 
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Definition 2.1. Let R > 2 be a real number. We call a sequence a = (oi, 02, • • • , R- 
growing if ai > R and Cj+i > a^, i = 1,2, . . . ,n — 1. 

Observation 2.2. For every 1- dimensional semialgebraic predicate $ = . . . ,Xk) there 

exists R > 1 such that either $ holds everywhere on every R-growing sequence a, or $ holds 
nowhere on every such a. Moreover, it is easy to decide by inspection of $ which of these 
cases hold (without explicit knowledge of R). 

Sketch of proof . First we note that if . . . , x^) is a polynomial and R is sufficiently large 
in terms of the degree and maximum absolute value of the coefficients of p, then the sign of 
p{ai-^,ai2, . . . , ajj,), ii < • • • < ifc, is given by the sign of the coefficient of the lexicographically 
largest monomial present in p{xi, . . . , x^) (where we compare monomials first according to 
the power of Xk, then according to the power of Xk-i, etc.). The validity of <1> can then be 
resolved based on the signs of the polynomials appearing in it. □ 



A Ramsey-type result with i?-growing sequences. The next idea in in our approach 
is that every sufficiently long sequence a "contains" a long i?-growing sequence, with a suitable 
meaning of "contains". 

Of course, not all long sequences contain i?-growing subsequences; at the very least, we also 
have to consider reversals of i?-growing sequences. Perhaps less obviously, we have sequences 
like 

A,A + 1,A + 2,...,A + N, 

where ^ is a number larger than A^, which contain neither fast-growing subsequences nor 
their reversals. 

In this case, we can find a translate of an i?-growing sequence, of the form A + hi, A + 
b2, ■ ■ ■ , A + bn, as a subsequence. For a slightly more sophisticated example, we consider the 
sequence 

A+-,A+-,...,A+-, 

with two large parameters A, B. Here we have a subsequence obtained from an i2-growing 
sequence (61, ... , 6„) by the rational transformation x ^ A + ^ . 

A key insight, coming from the thesis of Rosenthal [16], is that it suffices to consider 
rational transformations of simple form and with at most two parameters (like A and B 
above). To state this formally, we introduce the next definition. 

Definition 2.3. A i-parametric transformation is a rational function f = f{x,Xi,...,Xt) 
in t + 1 variables. 

Let b be a sequence of length n, let f be a t-parametric transformation, and let Ai, . . . , At 
be real numbers. We write f(b, Ai, . . . , At) for the sequence whose ith term is f{bi, Ai, . . . , At) 
for all i = 1,2, ... ,n. 

If a is a sequence of length N, we say that b as above has a t-parametric embedding into 
a via / if, for some Ai, . . . , At £ B., f{b, Ai, . . . , At) is a subsequence of a. 

Finally, if J- is a set of t-parametric transformations, we say that b has a t-parametric 
J-'-embedding into a if it has a t-parametric embedding into a via some f £ J^. 

Now we can state a key Ramsey-type result. Let Tq stand for the set of the following two 
2-parametric transformations: 

fi{x,X,Y):=X + Yx, /2(x,A,y) := A+|. 
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Proposition 2.4. For every n and R there exists N such that for every sequence a of length 
N there is an R-growing sequence b of length n such that b or rev(6), the reversal ofb, has a 
2-parametric Tq- embedding into a. Quantitatively, it suffices to take 

N -.= 2^ , 

provided that R> Rq for a sufficiently large constant Rq. 

The proof is presented in Section 4 below. A similar result, without the quantitative 
bound and stated in a different language, is implicit in Rosenthal [16] . 

3 Deciding predicates on transformed i?-growing sequences 

Next, we would like to generalize Observation 2.2 from an ii- growing sequence to a sequence 
c obtained from an i?-growing sequence b (or its reversal) by a 2-parametric transformation. 

Thus, let c = f{b, A, B), for some 2-parametric transformation f{x, X, Y) and some A,Bg 
M. For a given polynomial p{xi , . . . , x^) , we would like to understand the sign of p(cj^ , . . . , Cj^. ) . 

Let us substitute xt = f{yi,X,Y) into p{xi, . . . , Xk), and write the resulting rational 
function as the quotient of two polynomials (in yi, . . . ,yk, X,Y). We consider the sign of 
each them separately. 

Thus, let q{yi, ■ ■ ■ , yk, X, Y) be one of these two polynomials; we write it as a polynomial 
in yi, . . . , whose coefficients are polynomials in X, Y: 

q{yi, ...,yk,X,Y) = Y, Qa{X, Y)y'^' • • • , 

where A = A(q) is the set of all multiindices a = {ai, . . . ,ak) such that the coefficient 
qa{X, Y) of y"^ • • • y^'" is not identically zero. 

Unlike in Observation 2.2, here we cannot in general determine the sign of g(6i, . . . ,bk,A, B) 
just from the knowledge of the polynomial q and from the fact that b is i?-growing, since we 
do not know the signs and order of magnitude of the coefficients qoi{A, B). 

Let us define, for a, /3 S A, 

_ qa{A,B) 
-q^{A,B) 

(where, for simpler notation, we put pa/s '■= oo if qj3{A,B) = 0). We observe if each is 
either "not too large" (considerably smaller than bi) or "very large" (much larger than 6^), 
then the sign of q{bi, . . . ,bk,A, B) can again be determined. The following definition captures 
this condition. 

Definition 3.1. Let q = q{yi, ■ ■ ■ , yk, X, Y) be a polynomial and let A,BgM. We say that an 
R-growing sequence b = {bi, . . . , 6„) is i2-well-placed w.r.t. q,A,B if, for every a,{3 £ A(g), 
either 

• Pap is dwarfed by b, meaning that \pai3\ < bi/R, or 

• PafS is gigantic for b, meaning that \ p^p | > 6^ • 
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In this situation we define the type of q w.r.t. A,B, and b as the pair (cr, r), where a: A — )• 
{ — 1, 0, +1} is given by cr(a) := sgn qa{A, B), and r : — )• {D, G} is given by 



We also want to extend these notions considerations from a single polynomial g to a 
collection of polynomials coming from a predicate (or set of predicates). 

Let $ be a (finite) set of semialgebraic predicates and let / = f{xi, . . . ,Xk, X,Y) be a 
2-parametric transformation. For every polynomial p{xi, . . . , Xk) occurring in some predicate 
of $ we proceed as above, i.e., we perform the substitution Xi = f{yi, . . . ,yk, X,Y), and we 
express the resulting rational function as a quotient of two polynomials in yi, . . . ,yk, X,Y . 
Let Q = Q{^, f) stand for the collection of all the polynomials obtained in this way. 

Then we will talk about an i?-growing sequence b being R-well-placed w.r.t. Q, A, B (mean- 
ing that it is i2- well-placed w.r.t. q,A,B for every q G Q), and about the type of Q w.r.t. 
A, B, and b (this is the |(5|-tuple ((ug, Tq) : q G Q), where (ug, r^) is the type of q w.r.t. A, B, 



Next, we extend our Ramsey-type result (Proposition 2.4) so that it yields well-placed 
sequences. 

Corollary 3.2. Let J^q be as in Proposition 2.4, and let ^ be a set of semialgebraic predicates. 
Then there is a constant Ci = Ci($) such that for every n,R there exists 



such that for every sequence a of length N there is an R-growing sequence b of length n such 
that f{b,A,B) or f{T:ev(b), A, B) is a subsequence of a for some f £ J^q and A,B and 
moreover, b is R-well-placed w.r.t. Q{^, f), A, B . 

Proof. Let us take n' := C(n + 2), where C is a sufficiently large number depending on $ and 
J-'q, and use Proposition 2.4 with n' instead of n. Given a sufficiently long sequence a, we 
find an i?-growing sequence b of length n' as in the conclusion of Proposition 2.4, such that 
f{b, A, B) or /(rev(6), A, B) is a subsequence of o. 

Now b is not necessarily i2-well-placed w.r.t. Q{^,f), A,B, and so we consider all the 
values Pa/3 as in Definition 3.1, generated from all the polynomials in Q(^,f). These divide 
the real axis into intervals, whose number can be assumed to be at most C. Among these 
intervals, we fix one containing at least n' /C = n + 2 terms of b. We take the (contiguous) 
subsequence of b contained in this interval, and we delete the first and last terms. The 
remaining n-term subsequence is i?-well-placed w.r.t. Q{^, f)^A, B. □ 

Here is an analog of Observation 2.2. 

Lemma 3.3. Let ^ be a predicate, let b be an R-growing sequence, and let c = f{b, A, B) 
for some 2-parametric transformation f . Suppose that b is R-well-placed w.r.t. Q{^, /), A, B, 
and that R is sufficiently large in terms of $ and f. Then $ holds either everywhere on 
c or nowhere on it, and these possibilities can be distinguished based on ^, f, and the type 
of Q{^, f) w.r.t. A,B,b (without knowing A,B and b). A similar statement holds for the 
validity of ^ on rev(c). 




D if Pai3 is dwarfed by b, 
G if Pap is gigantic for b. 



and b). 



N <2 
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Sketch of proof. It suffices to check that ii q = q{yi, . . . ,yk, X,Y) £ Q($, /) is a single poly- 
nomial, then sgn q{bi-^, . . . jbi^^, A, B) the same for all choices of ii < • • • < (and it can be 
deduced from the type of q). 

The type of q w.r.t. A, B,b gives us the signs of the terms qa{A, B)b'^^ • • ■ b'^^ , a G ^(q)- 
We just need to check that whenever a,f3 G ^(q), a 7^ the absolute values of the 
terms qa{A, B)b^^ ■ ■ ■ and qp{A, B)b^^ ■ ■ ■ b^^ have different orders of magnitude. If one of 
qa{A,B) and qp^A^B) is 0, we are done. If they are both nonzero and paji is gigantic, the 
first term wins; if p^a is gigantic, the second term wins; and if both p^p and pap are dwarfed, 
then the comparison is lexicographic according to a and /3, as in Observation 2.2. □ 

Proof of Proposition 1.6. This proposition is an immediate consequence of Corollary 3.2 and 
Lemma 3.3. □ 

Proof of Theorem 1.4- Let k be the maximum arity of a predicate in and let no := ES#(A;) 
(this is well defined since $ is Erdos-Szekeres). 

Let us consider n > uq, let N := 2^*^^" be as in Corollary 3.2, and let a be an arbitrary 
sequence of length A^. Corollary 3.2 yields an i?-growing sequence b of length n such that, 
for some / S J-q and A, B £ c := f{b,A,B) or c := f{Tev{b), A, B) is a subsequence of 
a. Moreover, b is i?-well-placed w.r.t. Q{^, f), A, B. Then by Lemma 3.3, each $ E $ holds 
either everywhere on c or nowhere on c. 

Since n > hq = ES$(fc), the sequence c contains at least one fc-tuple on which some 
$ G $ holds, and hence this $ holds everywhere on c. This shows that ES$(n) < 2^^^" for 
all n > riQ. Since R depends only on this gives the desired bound ES^(n) < 2^"^*"' for all 
n > riQ, and the finitely many n < hq can be taken care of by setting C sufficiently large. □ 

The way we obtained the existence of riQ in the proof above was nonconstructive. Alter- 
natively, one can obtain an explicit dependence of uq on $ using the idea in Section 6. 

4 Proof of the Ramsey-type result for sequences 

We begin with a quantitative version of the "dichotomy lemma" of Rosenthal [16]. Let us say 
that a sequence b= (6i, . . . ,bn) satisfies the doubling differences condition, or the DDC for 
short, if \bk — bi\ > 2\bj — bi\ holds for every i < j < k (or, in our previous terminology, the 
predicate "|x3 — xi\ > 2\x2 — xi|" holds everywhere on 6). We note that the differences in a 
sequence b satisfying the DDC grow (at least) exponentially, e.g., — 6i| > 2^~^\b2 — bi\. 



increasing sequence a of real numbers of length N has a subsequence b of length n such that 
one of the two (increasing) sequences b and rev(— 6) (where —b stands for (—61, . . . , —bn) ) 
satisfies the DDC. 

Proof. Before proceeding, let us note that if we are not interested in the quantitative bound, 
the existence of a suitable follows easily from Ramsey's theorem for triples. Indeed, we 
observe that, assuming bi < bj < bk, the DDC b^ — bi > 2{bj — bi) is equivalent to b^ — bj > bj — 
bi. Given an increasing sequence (01,02, . . . ,OAr), we color a triple {i, j, k} C [N], i < j < k, 
red if o^ — aj > Uj — Oj, and blue otherwise. Now a red homogeneous subset corresponds 
to a subsequence b satisfying the DDC, and a blue homogeneous subset corresponds to a 



Lemma 4.1. Let n be a natural 




Then every (strictly) 
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subsequence b with rev(6) satisfying the DDC. The latter sequence is decreasing rather than 
increasing, but we can repair this by considering rev(— 6), since negation preserves the DDC. 

Now we present the proof giving a better quantitative bound; it resembles one of the 
well-known proofs of the Ramsey theorem for graphs. Let us define N{k,i) as the smallest 
N such that every increasing sequence a of length N contains a subsequence b of length k 
satisfying the DDC, or a subsequence b of length I with rev(6) satisfying the DDC. 

We have the initial conditions A^(2, i) = 2 and N{k, 2) = 2, and below we will derive the 
recurrence 

N{k,£) < N{k-1,£) + N{k,£-1) - 1, k,£> 3. 

It is well known, and easy to check, that this implies N{k,£) < (^^^), and so the bound in 
the lemma follows. 

To verify the recurrence, let := N(k — 1,£) + N{k,£—1) — 1 and let a be a nondecreasing 
sequence of length A^. We divide the interval [ai, cat] into the left and right subinterval by the 
midpoint ^(^i + o-n), and we let a' and a" be the parts of a in the left and right subinterval, 
respectively. Then lena' > N{k - 1,£) or lena" > N{k,£- 1). 

In the former case, if a' contains a subsequence b of length £ with rev(6) satisfying the 
DDC, we are done. Otherwise, a' has a subsequence b' of length k — 1 satisfying the DDC, 
and it is easy to see that the sequence obtained by appending to 6' also satisfies the DDC. 
The case of lena" > N{k,£ — 1) is analogous. This concludes the proof. □ 

Rather than using Lemma 4.1 directly, we will apply the following simple consequence, 
where the DDC is strengthened to R-fold expansion of the differences. 

Corollary 4.2. Let n be a natural number, let R> 1, and set r := [log2 i?] . Then there is 
N < 4:'^^"-^^'l+^ such that every (strictly) increasing sequence a of real numbers of length N has 
a subsequence b of length n such that the predicate "x^ — xi > R{x2 — xi)" holds everywhere 
on one of the two (increasing) sequences b and rev(— 6). 

Proof. Select a subsequence b' of a of length r(n — 1) + 1 as in Lemma 4.1, and define 
:= « = 1,2, . . . ,n. □ 

Proof of Proposition 2.4- Let n and R be given, let be sufficiently large as in the proposi- 
tion, and let a be a sequence of length N. 

First, if some number occurs at least n times in a, we are done, since the 1-parametric 
transformation /(x, X) = X {a special case of both fi and /2 with Y = 0) embeds any 
ra-term sequence into a. Otherwise, we may pass to a subsequence of with all terms distinct 
and of length at least N/n. Next, by the Erdos-Szekeres lemma, we can further pass to a 
subsequence a", with len(a") > N/n, that is either increasing or decreasing. We consider 
only the increasing case, the decreasing one being symmetric. 

Next, applying Corollary 4.2, we obtain a subsequence a^^^ of a", where lena" < ^riiena^^i-i, 
such that "x3 — xi> R{x2 — xi)" holds everywhere on a^^^ or rev(— a^'^-'). Let us again discuss 
only the former case. 

In this case we form a new sequence b^^^ of length lena^^^ — 1 by setting := a[^-^ — a[^^ 
We note that 6^^^ satisfies b^^ > and > Rb[^^ , and it embeds into a via the 1-parametric 

(3) 

transformation f{x, X) = x + X for X \= a\ . 
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(3) 

Now we apply Corollary 4.2 again, this time to the sequence I with li = log 6^ , and this 
yields a subsequence 6*^^^ of , such that 




holds for every i < j < k, or a similar relation holds for rev(l/6^^^). As expected, we again deal 
explicitly only with the first possibility, and we set c-^^ := i = 1,2, . . . , len(6^'^^) — 1. 

By this choice, we have c^^-^^ > (cf^)^. Moreover, since 1^"^-^^ > Rl>f^ for all i, we also 
obtain cf^ = 62^'' Z^^^"* > R, and so (^'^^ is ii-growing. It is also clear that c^^^ embeds into 6^^^ 
via the 1-parametric transformation g{x, X) := x/X with X := b^i \ and hence it embeds into 
a via the 2-parametric transformation h{x,X,Y) = f{g{x,X),Y). Thus, c^^-* is our desired 
sequence (called b in the proposition) . 

By following the chain of length estimates backwards, we get that 

AT > „42(K4'-("-i)+^-l)+2) 

suffices for the whole argument. Assuming that R exceeds a suitable constant (not very large, 
actually) , a series of rough estimates shows that the right-hand side can be bounded by 2^^" 
as claimed. □ 



5 The decision algorithm 

Before formulating the algorithm for deciding whether a given set of predicates $ is Erdos- 
Szekeres, we introduce some additional terminology. 

li q = (7(2/1, . . . ,yk-,X,Y) is a polynomial, a candidate type for g is a pair (cr, r) of arbi- 
trary functions o": A — )■ { — 1,0,-1-1} and r: A^ — )• {D,G}, where A is as introduced above 
in Definition 3.1. A candidate type for a collection Q of polynomials (all in the variables 
yi, . . . ,yk,X,Y) is an arbitrary sequence T = {{oq,Tq) : q G Q), where each {aq,Tq) is a 
candidate type for q. 

Definition 5.1. We call a candidate type T for a collection Q of polynomials feasible if for 
every R and every n there exist A, i? G M and an R-growing sequence b of length n that is 
R-well-placed w.r.t. Q, such that the type of Q w.r.t. A,B, andb equals T. Any such sequence 
b is called a feasible i?-growing sequence for Q and T. 

We note that our definition of feasibility does not use any particular value of R, but has 
the quantification "for all R\ Accordingly, we will never use an explicit value of R in our 
algorithm. 

In Section 6 below, we will provide a subroutine that tests feasibility of a given candidate 
type for a given collection Q of polynomials. Now we present the main algorithm, in which 
we use this feasibility testing as a black box. 



Algorithm for testing if $ is Erdos Szekeres. 

Input: A finite set $ of semialgebraic predicates. 
Output: YES if $ is Erdos-Szekeres, NO otherwise. 
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1. Let be the set of two 2-parametric transformations as in Proposition 2.4. For each 
/ let Q{^,f) be the collection of polynomials obtained from $ (as introduced after 
Definition 3.1). Perform the following steps for every f £ J^q and every candidate type 
T for Q{^, /). If all these steps are completed without returning NO, return YES and 
finish. 

2. Test the feasibility of T (as specified in Section 6 below). If T is feasible, continue with 
the next step; otherwise, continue at Step 1 with the next T or next /. 

3. Determine, by the method of Lemma 3.3, whether all for $ G hold everywhere 
on f{b, A, B) or on /(rev(6), A, 5), where 6 is a feasible i?-growing sequence for Q{^, f) 
and T (here A, B are the corresponding parameter values, which we need not determine 
explicitly; similarly, R is only assumed to be sufficiently large). If they do, return NO 
and finish the whole algorithm. If not, continue at Step 1 with the next T or next /. 

Proof of Theorem 1.1 (assuming the feasibility testing). The algorithm is clearly finite, so it 
suffices to verify that its answer is always correct. With the tools developed above, the proof 
is routine. 

First suppose that the algorithm returns NO; then this is because in Step 3 it has exhibited 
arbitrarily long sequences c on which all <^ G hold everywhere. So $ is certainly not 
Erdos-Szekeres. 

Now suppose that $ is not Erdos-Szekeres; this means that for some no, there exist 
arbitrarily long sequences for which no <I> G $ holds everywhere on any subsequence of length 
riQ. By Ramsey's theorem, this means that there are arbitrarily long sequences a on which 
each of $ G holds everywhere. (Alternatively, we can avoid using Ramsey's theorem 
and argue as in the proof of Theorem 1.4 at the end of Section 3.) For every A^, fix such a 
sequence a^^^ of length N. 

For every n, by Corollary 3.2, there is = N{n) such that for some n-growing n-term se- 
quence 0^ ' , some G Jo, and some 

^W^^W g M, one of the sequences 
and/(rev(6(")), A("),fi(")) is a subsequence of a^^^")), andb^") is n-well-placed w.r.t. Q(^>, /(")), 
Let r(") be the type of Q(*,/(")) w.r.t. and M"). 

Since J-q is finite and there are finitely many types, there is an infinite subsequence 
ni < 712 < ■ ■ ■ such that all the /("^^ are equal to the same / and all the T^"-'^ are equal to 
the same T. But then we get that T, regarded as a candidate type for Q{^, /), is feasible, 
and Step 3 returns NO for this / and T. This concludes the proof. □ 

6 Testing the feasibility of a candidate type 

The problem of testing feasibility of a given candidate type for a collection of polynomials 
can be recast in the following (slightly more general) terms. We are given a finite collection 
Q of bivariate polynomials (the ga(A, Y) in the setting of Definition 5.1), and for each q £ Q, 
a sign (Tg G {— 1,0,+1} is specified. Moreover, we are given two finite collections T> and G 
of rational functions in the variables X,Y (in the setting of Definition 5.1, P consists of the 
Pai3 that should be dwarfed by the desired feasible sequence, while Q are the that should 
be gigantic for it). The numerators and denominators of the rational functions in D and in 
Q belong to Q, and thus their signs are under control. Moreover, we may assume that the 
prescribed signs for all the denominators are nonzero. 
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The question then is whether we can make a large enough gap between the values of the 
functions in D and those in Q so that an n-term i?-growing sequence fits there. Formally, we 
thus ask for the validity of the formula 

^ := Vn € n yR3L, H : {L > R) A {H > L^"^^ ) A E{L , H) , 

where 

E{L,H) := 3X,Y:(^/\sgnq{X,Y)=ag) 

qeQ 

a(/\ piX,Y) < ^) A (/\ piX,Y) > Hy 
P&v peg 

We would like to test the validity of ^ using a decision algorithm for the first-order theory 
of real-closed fields (the first such algorithm is due to Tarski [18], and we refer, e.g., to [2] 
to more recent ones). While E{L,H) can easily be rewritten to a first-order formula in the 
theory of the reals (since sgnq{X,Y) = aq is expressed as a polynomial inequality, and the 
inequalities p{X, Y) < L ox p{X, Y) > H can be multiplied by the denominator, since we 
know its sign), ^ is not such a formula: quantification over the natural numbers, and more 
importantly, the exponential function, do not belong to the first-order theory of the reals. 

To remedy this, we replace ^ by the formula 

^* := Vi? 3L:{L>R)A Vi/ E{L, H). 

The validity of ^ can be tested by the decision algorithms mentioned above, and so for 
completing our subroutine for feasibility testing, it suffices to prove the following. 

Lemma 6.1. The formulas ^ and ^* are equivalent. 

Proof. Clearly ^* implies ^. For the reverse implication, let us consider the function 

h{L) ■.= sup{H : E{L,H)]. 

Supposing that ^* does not hold, we see that there arbitrarily large values of L such that 
h{L) is finite. Since /i is a semialgebraic function, it has to be finite for all L > Lq for some 
Lq. But if ^ did hold, then for every m there are arbitrarily large values of L with h{L) > L*", 
while it is easy to check, and well known, that a semialgebraic function may have at most a 
polynomial growth. Hence \I' does not hold and the lemma is proved. □ 



7 A lower bound for the Ramsey function 

Proof of Proposition 1.5. We recall that the cross-ratio of an ordered 4-tuple (zi, . . . , Z4) of 
real numbers is defined as 

, ^ {Zl- Z3){Z2 - Za) 

[zi, Z2; Z3, Z4) := -. 

{Z2 - Z3){Zi - Z4) 

As is well known, the cross-ratio is invariant under projective transforms of the form x 1— )■ 
{ax + b)/{cx + d) (assuming ad — be / 0), and in particular, it is invariant under the two 
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transformations X,Y) = X + Yx and 72(2;, X^Y) = X + Y/x of (unless 1" = 0, that 
is). 

We define the set of predicates $ = {<I>i, <I>2, ^*3} as follows: 

^>l(xi,X2) := Xi=X2, 

^>2(xi,...,X5) := DISTINCT(xi,...,X5) A (xi,X2;x3,X4)2 >4 

A (xi,X2;X3,X5)^ > (xi,X2;X3,X4)'*, 
4>3(X1, . . . ,X5) := ^2{x^,Xi,...,Xi). 

Here DISTINCT(xi, . . . , X5) has the expected meaning. The condition (xi, X2; X3, X4)^ > 4 
should be read as |(xi, X2; X3, X4)| > 2, but it is written in squared form, so that it can 
be expressed as a polynomial inequality and we need not worry about the sign. A similar 
comment applies to the last condition of ^2- 

First we check that $ is Erdos-Szekeres. Thus, let n be given, let i? be a suitable constant, 
and let a be a sufficiently long sequence. By Proposition 2.4, there is an n-term ii-growing 
sequence h such that h or rev (6) has a 2-parametric embedding in o via J-q. 

It is easy to check that <I>2 holds everywhere on any i2-growing sequence with a sufficiently 
large constant R (intuitively it is clear that the cross-ratios have to grow fast, and formally it 
can be checked as in Observation 2.2), and consequently, <I>3 holds everywhere on the reversal 
of an i?-growing sequence. 

Since ^2 and <I>3 are expressed in terms of cross-ratios, and these are invariant under 
the transformations of J-q, we get that if b J-Q-embeds into a, then either the image of b is 
a constant sequence, or then ^>2 holds everywhere on the image of b. Similarly, if rev(6) 
Jxi-enibeds into a, then either the corresponding subsequence of a is constant, or <I>3 holds 
everywhere on it. Thus, $ is indeed Erdos-Szekeres. 

Next, we want to prove a doubly exponential lower bound for ES^(n). To this end, let 
a = (1, 2, ... , A^), and let c be an n-term subsequence on which one of ^>2, <&3 holds everywhere 
{^i is out since the terms of a are all distinct). By possibly replacing c by its reversal, we may 
assume that $2 holds everywhere on c. Then the cross-ratios in c satisfy | (ci, C2; C3, 04)! > 2 
and |(ci,C2;c3,Cj+i)| > |(ci, C2; C3, Ci)p, and so |(ci, C2; C3, c„)| > 2^" . Since all terms of c 
are integers, at least one of them has to be at least 2^ . □ 

8 Replacing a set of predicates by a single predicate 

Here we consider two simple constructions for replacing a finite set $ = {<I>i, ...,<!>.,•} of d-di- 
mensional A;-ary predicates with a single predicate ^, in such a way that $ is Erdos-Szekeres 
iff $ is. 

In the first construction, we simply set ^> := Vi=i ^i- Then if some holds everywhere 
on some sequence a, then so does Conversely, assuming that ^> holds everywhere on a 
sequence a of length N , we color each fc-tuple Ji < • • • < jk of indices by a color i such that 
$i(ajj, . . . ,ajj,) holds. Then by Ramsey's theorem, some holds everywhere on a suitable 
subsequence of length n, provided that is sufficiently large in terms of n, fe, and r. 

However, for this construction, it might happen than ES|§| is much larger than ES^. Here 
is the second construction, which preserves the Ramsey function but increases the number of 
variables and the complexity of the predicate considerably. 
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Lemma 8.1. Let $ = {^'i, . . . , ^r} be a set of k-ary predicates. Then there is a predicate ^ 
in rk variables such that {^} is Erdos-Szekeres iff $ is, and if they are Erdds-Szekeres, then 
ESj(n) = ES$(n) for all n > rk. 

Proof. We first define (rA:)-ary predicates ^'j, i = 1, 2, . . . , r, where ^'i(xi, . . . , x^fc) expresses 
that $j holds everywhere on the sequence (xi,X2, . . . ,^rk)- Explicitly, 

^i(xi,...,Xrfc) := l\ $»(xj,,...,xjj. 

l<ji<---<ife<rfc 

Then we set $(xi, . . . ,Xrfc) := Vi=i "^ii^i, ■ ■ -^^rk)- 

Clearly, if some holds everywhere on a sequence a, then so does Conversely, suppose 
that ^ holds everywhere on a sequence a of length n > rk; we claim that then some $i holds 
everywhere on a as well. 

If it were not the case, we fix, for every i = 1,2, ... ,r, a fc-tuple Jj C [n] such that $i 
does not hold on the corresponding k terms of a. We consider the union (JI=i a-Ta.d, if it 
has fewer than rk elements, we add the appropriate number of other elements of [n] (chosen 
arbitrarily) to it, obtaining an rfe-element set J. Then $ does not hold on the subsequence 
of a indexed by J; the resulting contradiction proves the lemma. □ 

9 Algebraic predicates, and the multipartite setting 

In this section we prove the effective compactness for d-dimensional algebraic predicates (The- 
orem 1.8), as well as the effective compactness for semialgebraic predicates in the multipartite 
setting (Theorem 1.9). We begin with the multipartite setting, since the proof is formally 
somewhat simpler. 

We will repeatedly use the following straightforward fact: for every set P C M[xi, . . . ,xt] 
of i-variate polynomials, each of degree at most D, there is a subset Po P at most 
(^i"*) polynomials that defines the same variety in M* as P, where the variety defined by 
P is V{P) = {x G M* : p(x) = for all p G P}. Indeed, the vector space of all t-variate 
polynomials of degree at most D has dimension ('^^*) , with the set of all monomials forming 
a basis, and if we choose Pq as a basis of the subspace generated by P, then we have V{Po) = 
V{P). We will refer to this fact as the bounded- dimension argument. 

Proof of Theorem 1 . 9. Throughout the proof, by saying that a certain quantity is bounded 
we mean that it can be bounded from above by some explicit function of d (the space dimen- 
sion), k (the arity of the predicate), and D (the maximum degree of the polynomials in the 
predicate) . 

To explain the idea of the proof in a simpler setting, we first assume that $ is algebraic 
and binary, i.e., k = 2. 

We need to prove only the implication (i)=^(ii). So we assume that <I>(xi,X2) holds 
everywhere on Ai x A2, where \Ai\ = \A2\ = N is large but so far unspecified. Our goal is to 
construct infinite sets Xi , X2 ^ M"^ such that ^ holds everywhere on Xi x X2 ■ 

By the assumptions of the theorem, $ is a conjunction of polynomial equations of degree 
at most D, and by the bounded-dimension argument, we can assume that the number of 
equations is bounded. 

The plan is the following: We are going to define two semialgebraic sets (and actually, 
algebraic varieties) ^1,25^2,1 Q such that Ai C Vi^2, ^2 ^ ^2,1, and <I> is easily checked 
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to hold on Vi^2 x ^2,1. We will also show that Vi^2 and V2,i can be defined by bounded-size 
formulas (i.e., the formula is a Boolean combination of at most m polynomial equations and 
inequalities, each of degree at most D, where m and D are bounded). Then we will use a 
result stating that the number of connected components of a semialgebraic set definable by 
a bounded-size formula is bounded (a specific result of this kind is a theorem of Warren [21, 
Theorem 2], based on the well-known Oleinik-Petrovskii-Milnor-Thom theorem [14, 11, 19] 
on the sum of Betti numbers of a real algebraic variety; also see [2, Chap. 7]). Finally, if 

= 1 is larger than the number of components of Vi,2) then two points of Ai must be 
connected by a path, and hence Vi^2 is infinite (and similarly for V2,i). 

It remains to define Vi^2 and V2,i and to verify the claimed properties. Let Fi = Fi(xi) be 
the formula Aa2eA2 ^(^11^2), and let Vi C W'- be the variety {xi G R'^ : Fi(xi)}. We define 
-^2(x2) := AaisAi ^(^i)X;2) and V2 similarly. We have C VJ, i = 1, 2, by the assumption. 

We set 

Vi,2 := {xi G Vi : (Vx2 G V2) ^'(xi,X2)} 

and 

V2A := {x2 G V2 : (Vxi G Fi) ^>(xi, X2)}. 

By definition, $ holds everywhere on Vi^2 x ^2, as well as on Vi x V2,i. Since Vi,2 Q Vi and 
^2,1 ^ 1^2) we get that $ holds everywhere on Vi^2 x V2,i- 

It is easy to see that Ai C Vi^2 and A2 ^ V2,i, and we now want to argue that Vi^2 and 
V2,i can be defined by bounded-size formulas. 

First, we Vi is the variety defined by the polynomials /(xi,a2), where / is one of the 
polynomials in $ and a2 G ^2- By the bounded-dimension argument, only a bounded number 
of these polynomials suffice to define Vi, and hence we can obtain a bounded-size formula Fi 
that is equivalent to Fi. Similarly, from F2 we obtain a bounded-size equivalent formula F2. 

Now we consider the formula -Fi,2(xi) defining Vi^2- By definition, 

Fi,2(xi) := Fi(xi) A Vx2(F2(x2) ^ $(xi,X2)). 

We now replace Fi and F2 with Fi and F2, which gives a bounded-size formula describing 
Vi^2) and then we apply quantifier elimination to obtain a quantifier- free formula Fi^2(xi) that 
is equivalent to Fi^2(xi) and thus also defines Vi^2- Moreover, by the properties of available 
quantifier-elimination methods for the first-order theory of the reals (see [2, Sec. 11.3]), the 
size of Fi^2 is still bounded. Hence we can bound the number of connected components of 
Vi,2 as in the plan above, and the considered special case of Theorem 1.9 is proved. 

Next, we still assume that $ = $(xi, . . . ,Xfc) is algebraic, but it can be A;-ary for any k. 
The basic idea is the same as before, but this time we define a more complicated sequence of 
sets Vi^j, with i G [k] and J ^ [/c] \ {z}, inductively by 

Vifi := {xi G M"* : (Vai G ^1) • • • (Va^^i G Ai_i)(Vai+i G ^i+i) • • • (Va^ G Ak) 

$(ai, . . . ,ai_i,Xi,ai+i, . . . ,aA;)| 

and, for J / 0, 

Vi,j := {x, G fl Vij> : (Vx,- G Vjj\{j},j G J)(Va,- G Aj,j G [k] \ {i} \ J) 
J'cJ 

$(xi;xj, j G J;aj,j G [/c] \ {i} \ J)}. 



17 



where the sequence of arguments of ^> means that the ith argument equals Xj, the jth argument 
equals Xj for all j S J, and the jth argument equals all j G [k] \ {i} \ J. 

By definition, for J' C J we have Vi^j C Vt^j/, and this then shows that $ holds everywhere 
on X • • • X Vfc Inductively it is also easy to show that Ai C Vij for all J. 

It remains to show that each Vi^j can be described by a bounded-size formula, possibly 
involving quantifiers (then we apply quantifier elimination and bound the number of connected 
components as in the case k = 2). We are going to show this by induction on | J|. 

Thus, the inductive hypothesis is that we have bounded-size formulas Fj^j(xj), possibly 
with quantifiers, describing Vij, for all J up to some size s, and we want to get such a formula 
for J of size s + 1. 

By the definition, we have the following formula defining Vi^j: 

Fi,j(xi) = ( A hj'i^ 



A(Vxj,j G J) 



where Gij{xi;Xj, j G J) is the formula asserting that <I>(xj;xj,j G J',st.j,j £ [^] \ \ {*}) 
holds for all aj £ Aj, j G [^] \ \ Now Gi^j can be replaced with a bounded-size formula 
by the bounded-dimension argument, and this yields the desired bounded-size formula Fij, 
finishing the induction step. We have proved the desired result for algebraic predicates. 

Finally, we let <I) = <I>(x 1, . . . , x„) be an arbitrary semialgebraic predicate. We may assume 
it to be of the form $ = $iV$2V---V where each $j is a conjunction of polynomial 
equations and strict inequalities. Given large sets Ai, . . . , Af. such that $ holds on yli x • • • x 
Ak, by the /c-partite Ramsey theorem we get that some holds everywhere on vl'^^ x • • • x A'f^, 
where the A'- C Ai are still large. 

Thus, we assume that $ = $= A where ^>= is a conjunction of polynomial equations 
and is a conjunction of strict polynomial inequalities, and that $ holds everywhere on 
Ai X ■ ■ ■ X Ak- 

By the argument above for the A:-partite algebraic case, we obtain curves Yi D Ai, . . . ,Yk D 
Ak such that <!>= holds on Yi x • • • x Y^. In particular, there are ai £ Ai, i £ [k], such that 
each aj is a cluster point of Yi. Since <!>< holds at (ai, . . . , a^), it also holds for all (yi, . . . , y^) 
from a small neighborhood of (ai, . . . ,afc). Thus, we can choose infinite sets C 1^ such 
that $ holds everywhere on Xi x ■ ■ ■ x Xk as desired. Theorem 1.9 is proved. □ 

As was mentioned in the introduction, the analog of Theorem 1.9 also holds for algebraic 
predicates over the complex numbers. To see this, only two ingredients in the proof above 
need to be changed: first, quantifier elimination for the first-order theory of the reals needs 
to be replaced by quantifier elimination for the first-order theory of algebraically closed fields 
(see [15] for a recent work and references), and second, we bound the number of connected 
components of a variety in C* by regarding it as a semialgebraic set in R-^* and applying the 
same bounds as in the proof above. 

Algebraic predicates on sequences. Theorem 1.8 follows easily from the next two 
lemmas. The first lemma is very similar to Theorem 1.9, but the sets Ai, . . . ,Ak in (i), as 
well as the sets Xi , . . . , X^. in (ii) , are all equal. 
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Lemma 9.1. In the setting of Theorem 1.9, if a d- dimensional k-ary semialgebraic predicate 
involving polynomials of degree at most D, is assumed to hold everywhere on A'' = A x 
■ ■ ■ X A, where \A\ = N = N{d, k, D), then there is an infinite X such that $ holds everywhere 
on X^. 

Proof. First we assume algebraic. We consider the "symmetrization" <I?sym of <I>: 

(xi, . . . ,Xfc) := l\ $(x^(i), . . . ,x^(fc)) 

(the conjunction is over all permutations of [k]). Then 'I>sym also holds everywhere on A'^ . 
When we define the sets Vi^j for <l>sym as in the proof of Theorem 1.9 above, we have that 
$sym; and hence holds everywhere on HiLi By the symmetry of $sym) the Vi,j 

satisfy = Vi^j for every permutation tt, and in particular, the varieties ^^^[fcj^j are 

all equal. Denoting their common value by V , we get that $ holds everywhere on . The 
infinitude of V follows by bounding the number of components as above. 

If $ is semialgebraic, then we again proceed as in the proof of Theorem 1.9, observing 
that the choice of Xi, . . . , Xj. can also be done symmetrically, i.e., with Xi = • • • = X^. □ 

The second lemma shows that for algebraic predicates, we can pass from "holding every- 
where on a long sequence" to "holding everywhere on a large Cartesian power" . 

Lemma 9.2. For every d,k,D,n there exists N (for which an explicit bound can be given) 
with the following property. Assuming that a d-dimensional k-ary algebraic predicate involving 
polynomials of degree at most D holds everywhere on a sequence a of length N , with all terms 
distinct, then there is an n-element set B C a such that $ holds everywhere on (here the 
inclusion B C a means that B consists of some of the terms of the sequence a). 

Proof. The proof is again based on the bounded-dimension argument. Let us set b*-''^ := a, 
and let us assume inductively that b'--'^^^ is a subsequence of a of length uj^i such that 
$(bj^, . . . ,bjj,) holds for all choices of ii, ■ ■ . ,ij-i G [^^j-i] and 1 < < ij+i < ■ ■ ■ < ik ^ 

For i = 1, 2, . . . , Uj-i, let 

Wl'^ := {(xi, . . . , X,) G {M.'^y ■■ {yij+i, ..■,ik,i< ij+i < ■ ■ ■ < ik) 
$(xi,...,Xj,bi^.^^,...,bij|. 

The inductive assumption gives (bj^, . . . ,bi^. ) G W-'''^ for all G and all 

ij G [i]. 

We have C W2 ^ • • • , and by the bounded-dimension argument, there are only a 

(i) 

bounded number of distinct varieties among the W-^ . Thus, there exist imin < ima.x, with 

(i) (i) 

«max - imin large, such that W>-" = W>-" . 

''mm '■max 

For ah ii, . . . G [uj-i], we have (bj^, . . . ,bj^) G W^^^^^ = w/^^^ for aU ij G [imin + 

Ij imax]i and by the definition of W^^^^^, we obtain that $(bj^, . . . , bj^.) holds for alHi, . . . , ij G 
Pmin + 1, imax] and all ij+i, . . . ,ik with imin < ij'+i < " " " < ifc- Hence we can let b(j) be the 
subsequence of b^-'"^-' indexed by [zmin + l,imax]- This finishes the induction step. 

The proof is concluded by letting B be the set of elements of b^'^^ □ 
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